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(57) Abstract 

The present invention relates to a selection 
method that allows fast recovery and identification 
of functional gene fragments which selectively in- 
hibit growth, e.g., are cytostatic or cytotoxic, of 
particular cell-types, such as transformed cells. 
The strategy relies, in part, on the ability of small 
gene fragments to encode dominant-acting syn- 
thetic genetic elements (SGEs), e.g., molecules 
which interfere with the function of genes from 
which they are derived. SGEs which can be iden- 
tified by the subject method include, but are not 
limited to, inhibitory antisense RNA molecules, ri- 
bozymes, nucleic acid decoys, and small peptides. 
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Methods and Reagents for Identifying Synthetic Genetic Elements 



RarU ground of the Invention 

Functional inactivation of genes through the expression of synthetic genetic elements 
5 comprising all or a part of the gene to be inactivated is known in the art. At least four 
mechanisms exist by which expression of such specific genetic elements can result in 
inactivation of their corresponding gene. These are interference with protein function by 
polypeptides comprising nonfunctional or partly nonfunctional analogs of the protein to be 
inhibited or a portion thereof, interference with mRNA translation by complementary anti- 
10 sense RNA or DNA, destruction of mRNA by anti-sense RNA coupled with ribozymes, and 
interference with mRNA by RNA sequences homologous to a portion of the mRNA 
representing an important regulatory sequence. 

Although gene suppression is quite useful for scientific studies of gene function and 
holds considerable promise for certain applications in disease therapy and genetic modification 
15 of plants and animals, current methods for identifying effective synthetic genetic elements 
(SGEs) are time consuming and arduous. Interference by dominant negative mutant proteins, 
for example, either requires extensive knowledge about the functional domain structure of the 
protein so that reasonably promising candidate mutant proteins can be prepared, or necessitates 
individual preparation and screening of numerous candidate mutant proteins. Antisense RNA 
20 and competitive homologous RNA similarly require extensive individual preparation and 
• screening of candidate inhibitory sequences, absent considerable knowledge about important 
specific sequences within the RNA. 

There is, therefore, a need for generalized methods for identifying and isolating SGEs 
that will allow simplified determination of effective elements without undue experimentation 
25 or extensive structure/function knowledge. An ideal method would allow simultaneous 
analysis of multiple possible candidate SGEs, regardless of their mechanism of action. 



30 



Brief Summary of the Invention 

The present invention facilitates drug discovery by providing a method for identifying 
agents that selectively confer a desired phenotype on a target cell, comprising: 
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(i) transfecting subtractive cells with a library of expression vectors comprising a 
variegated population of coding sequences for potential synthetic genetic elements 
(SGEs); 

(ii) isolating those SGE vectors of the SGE library that do confer the desired phenotype, or 
5 a phenotype that interferes with the detection thereof, to the subtractive cells; 

(iii) transfecting target cells with the sub-population of SGE vectors isolated in step (ii); 
and 

(iv) isolating those SGE vectors that confer the desired phenotype to the target cell. 

One aspect of the subject method relates to a method for identifying agents with 
10 selective antiproliferative activity for a target cell, comprising: 

(i) transfecting subtractive cells with a library of expression vectors comprising a 
variegated population of coding sequences for potential synthetic genetic 
elements (SGEs); 

(ii) isolating those SGE vectors of the SGE library that are not lethal to the 
15 subtractive cells; 

(iii) transfecting target cells with the non-lethal SGE vectors isolated in step (ii); and 

(iv) isolating those SGE vectors that are lethal to the target cell. 

In preferred embodiments, the method is carried out using target and subtractive cells that are 
eukaiyotic cells, more preferably mammalian cells. In certain embodiments*, one or both of the 
20 target and subtractive cells are human cells. 

In certain embodiments, the target is a transformed cell, and subtractive cell is an 
untransformed cell. 

In other embodiments, the target is a cell infected with a virus, and subtractive cell is 
an uninfected cell. 

25 In preferred embodiments, the expression vectors are viral vectors, and more preferably 

retroviral vectors. 

The SGE library can be generated from a normalized cDNA library, a subtractive 
cDNA library, or a combination thereof. 
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By random incorporation, or duectiona, cloning, lb. library can be generated ,0 include 
SGE having a sense oriented sequence encoding a peptide, and/or an anusense-onented 
sequence encoding an antisense RNA. 

Another aspect of the invention reiates ,0 formulations of the SOEs identifed 
5 according to the subject method. For example, the present tnvention provides a synthetic 
oligonucieotide having a nuclide sequence from about 12 nuclides to aH of he 
nuc.eo.ide sequence of an antisense RNA encoded by an SGE .den.ified 
subject me.hod. In o.her embodiments, me inven.io„ provides an iso„ed peptide encoded by 
an SGE identified according to the subject method, or a peptidomimetic thereof. 
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fir^fnAcrri ptioii of the Figures 

Fl gures 1-8 are flowcharts illustrating exemplary embodiments of the subject assay. 

T), I Mir d Irr-rrir^" " f ' hp deferred Embodiments 

15 i. Overview 

In one aspect, me present invention facilia.es drug discovery by providing a method for 
,den«ifying nuc.cic acid sequences ma. confer a desired phenotype in a cell-type selective 
mamter. The strategy rehes, in pa*, on the ability of small gene fragmen* to encode 
dominant-acting synthetic genetic elements (SGEs), e.g., m„,ecu,es .ha, interfere 
20 function of genes from which .hey are derived (an.ago„is.s) or that are dominant c—ly 
active fragments (agonists) of such genes. SGEs that can be identified by the subjec method 
mclude, but are no. limi.ed to, polypeptides, inhibitory antisense RNA molecules, rhymes, 
nucleic acid decoys, and small peptides. 

In addition to being of d.rec, po.en.ia! .herapeutic value, e.g., as a pharmaceutical, an 
25 SGE identified by the present invention identifies an endogenous gene that is a,so of potential 
d,agnos,ic and other therapeu.ic value. For ins.ance, a gene whose activity is inactiva.ed by an 
identified SGE can ..self be used as a <arge, for drug development e.g., .o identify outer 
agents, such as smaH molecu.es and na,„ra, ex.rac B , which can also inhib,. me function of me 
endogenous gene. Thus, anouter aspec. of ,he presen, invention provides drug screenmg 
30 assays for de,ecti„g agonis* or an.a g onis.s, as appropriate, of a gene (or gene prc^c, .hereof) 
tha. corresponds .o a selected SGE. Likewise, the identification of an SGE .ha, can mh,b,. a 
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particular pathological phenotype will indicate diagnostic assays that can assess loss-of- 
function or gain-of-function mutations, as appropriate, to the corresponding endogenous gene. 

In general, the subject method utilizes one or more "subtractive" cell lines, and a 
"target" cell line. As used herein, a "desired phenotype" refers to a particular phenotype for 
5 that the user of the subject method seeks to have selectively conferred on the target cell line 
upon expression of an SGE. That is, the desired phenotype is dependent upon expression of an 
SGE, and is selectivity conferred, upon expression of the SGE, in the target cell line but not 
the subtractive cell line(s). The subtractive cell lines are used to remove SGEs from a 
variegated SGE library on the basis that these SGEs interferee with the detection of the desired 
10 pehnltype in the target cells, e.g., SGEs are removed for not being selective for the target cell 
line with respect to conferring the desired phenotype. 

Figure 1 illustrates a general approach of the method. The subject method can 
comprise a first step of providing a variegated SGE library, e.g., from the target cells. Those 
SGEs of the library which do not interfere with the detection of the desired phenotype in the 
15 target cells, are rescued from the subtractive cells. This sub-library, e.g., the "subtracted 
library", is then transfected into the targe", cell line. SGEs that confer the desired phenotype on 
the target cells can be identified, e.g., isolated and sequenced. At this point, the "subtracted 
SGE library" is comprised of two classes of SGE's. The first class includes DNA fragments 
that either do not have SGE activity (do not inhibit/interfere with gene function) or the SGE 
20 inhibits the activity of genes whose inhibition have no relevant phenotype in the screen. We 
expect that the majority of the library belong to this class. The second class contains 
functional SGEs that interfere with the function of genes whose inhibition confer the desired 
phenotype. We expect that a small fraction of the enriched SGE library will belong to this 
class. To enrich the second group of the SGEs we utilize a repeated transfection -> selection 
25 -> amplification cycle. The result of this cycle is that SGEs with the desired phenotype are 
amplified while SGEs with no relevant phenotypes are lost from the library. The transfection - 
-> selection -> amplification cycle continues until substantially all of the transfected cells 
displays the desired phenotype. These SGEs, by virtue of the enrichment step, will selectively 
confer the desired phenotype on the target cell population, selectively at least with respect to 
30 the various subtractive cell lines used in the enrichment step. 

For instance, in one embodiment, the present invention provides a selection method 
that allows fast recovery and identification of functional gene fragments that selectively inhibit 
growth of, eg., are cytostatic or cytotoxic for, particular cell-types - such as transformed cells. 

-4- 
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An advance of the subjec. tnemod is «* « permits pheno«ypic section of MM SGEs 
^ dominant inac,iva,ion of genes critica, to growth in one cel. W e bu, not another. For 
example, the subject method can be used to identify SGEs that are ,e,ha. to nmror cells, bu, 
which do no, substantially interfere with the growth or viability of normal cells. 

5 Figure 2 illustrates an exemplary mode for performing the method of ,h= presen, 

invenhon ,o identify SGEs ma, inhibit growth of transformed cells, e.g. tumor cells. An SGE 
hVary is provided in the form of a libra* of expression vectors, i.e., ,ha, is variegate wuh 
rK pec, to the sequence of the SGE provided ,n the individual vectors of the libra*, m he 
Hlusua.ed scheme, ,he SGE library is general from nudeic acid isola,ed from tumor ee ls 

,0 The SGE horary is .ransfected mto subtract™ cells, e.g., such as the uniformed cell that 
correspond ,o the tumor cel. from which the SGE library ,s generated. To illustrate, the SGE 
hbrary can be generated from cDNA isolated from a basa, cell carcinoma, and sub,,ac.,ve 
cells can be normal keratinocytes. 

The SGE expression vectors are rescued from the those subtracttve cells that are viable 

15 anersevera.divis.onsundercondi.ionswhereinmeSGEsareexpressed.Therationaieofdus 

enrichment step is based a. leas, in pan are the undercunding mat a number of me SGE 
sciences w,H inhibit housekeeping genes, an even, tha, would be cytotoxic to both norma, 
and .reformed ecus. Cells tha. have been .ra»sfec,eu with SGEs tha, are leth, to norma, 
cells wi„ d,e, and those SGE sequences wi„ bo ,os, from me hbrary. The rescued porhon of 
20 me library will be enriched for SGE sequences that arc not letha, to the subtle ce„. 

The sub.rac.ed SGE library is .hen transfected into the targe, tumor ce„s. Transfected 
, um or ce„s .ha. undergo apop.osis are iso,a.ed, and the SGE expression constructs are rescued 
ta, those cell, The rescued SGE sequences represent .hose ma, are no, lethal » me 
sub.nc.ive cells, bu, ,ha, are eventually lethal ,o the targe, cell. The lethal SGE sequences 
25 may be recyCed through one or bom of me e^ichmen. s.eps, e.g„ in order «o further ennch for 
specific and highly penetrant SGEs. 

,„ genera,, mis approach is useful to identify SGEs, and their corresponding urge, 
genes, whose ac.ivi,y is essen.ia, for me surv,va, of diseased cells and/or for me maintenance 
of ,he diseased phenotype. The inhibit of me -disease specific genes" in d,seased cells 

irreversible effect on the physiology of normal cells. 
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The SGEs and target genes identified by this method can be used for the selective 
treatment of the disease or as a means for discovery of useful drug therapies. The encoded 
proteins can be used in drug screening assays to identify small molecular weight drugs that 
inhibit the function of the target gene by interfering with the function of the encoded protein. 
5 The therapeutic window of these drugs is likely to be broad since the inhibiton of the gene has 
no effect on normal cells but lethal to the diseased cells, as demonstrated by the selectivity of 
the SGEs used to identify these genes. 

The general scheme to find "disease specific genes" is described below and illustrated 
in Figure 3. The SGE library is made from diseased cells. First, the SGE library is transfected 

1 0 into the corresponding normal cells. After several cycles of cell division, cells that have been 
infected with SGEs that are lethal for the normal cells, will die and will be lost from the 
library. This subtracted library will be recovered from normal cells and will be used to 
transfect the diseased cells. SGEs will be rescued from dying diseased cells, amplified and re- 
transfected into the diseased cells. This cycle will be continued until substantially all of the 

15 diseased cells are dying after the transfection of the enriched SGE library. At this point, 
individual SGEs will be rescued and their redundancy in the enriched SGE library will be 
determined by hybridization. After sequencing the SGEs, their endogenous gene target will be 
determined. The specificity of the GSEs can further be investigated after transfection into a 
panel of normal and other diseased cells. 

20 In other embodiments, the assay can be derived to identify SGEs that selectively inhibit 

the proliferation of virally-infected cells, e.g., but not normal cells. For example, the SGE 
library can be enriched for SGEs that do not inhibit proliferation or viability of normal T 
lymphocytes, but that are cytotoxic or cytostatic for HIV-infected T lymphocytes. Such SGEs 
may be useful for human therapeutic applications, such as antiviral therapy. The endogenous 

25 gene corresponding to the selectively lethal SGEs can also be, as described below, the target of 
a drug screening assay for identifying other agents that, by mimicing the SGE, have selective 
antiviral activity. 

In similar fashion, the subject method can be used to identify SGEs that inhibit 
replication of pathogenic viruses in particular cell-types, and then to generate genetically 
30 modified cells using the SGE. Genetically modified cells according to the invention can 
provide benefits, such as cell-specific virus resistance, which can be commercially important 
in biotechnology processes using living cells, as well as in food crops derived from virus- 
resistant cells, or even in agriculturally important transgenic animals. 

-6- 



PCT/US00/06385 

WO 00/53743 . 

In another — imP-ed agricultural ^ can » *~ 
modif,ca,ion by identification of SGEs that suppress genes response for u„des,r 
!!1 e.g to P-icide, —nation of inbred plants, or o her 

^ * isnWcar, * Thus,,he SCEs - be used ,o create ,ransgen,c plants ac^ 
5 rldeslb e trai,s, or to deve,op s m a„ molecules or other a g e„,s ,ha, mnrnc me ab„„y 
TsGE ,o confer such ,rai,s (e.g., by modulating me ac,ivi,y of me correspond,^ 
endogenous gene or gene product). 

Hgure 4 illustrates a scheme to isolate SOEs which which supress genes required for 
pesticide esistance in p>an,s. For Ms application the SGE library is made from peshc, e 

distant ce,,s i„ the absence of pes,iodes (subtractive cells) ,0 — nonna, ^SGE 
The infec.eo survivor cells are incubated with the pesticide (m th,s case no «d for to «. 

he subtracted libra.) and death cells are collected. The SOEs are rescued, amp , c, , 
in fec,ed and rescued aga,„ front Ote death cehs u„t„ -100% of the mfeCed cells d, 

resistance in plants. 

ln sttH other enrhodimen., the subject m«hod can be used ,0 identify SGEs, and 

conseouently endogenous genes, which can modulate cell differentiation. For example th 

Hen, tep can uti.ize, as subtract ceUs, various progenitor cells m d/or 

* 2 Hse to cln cell-type, SGEs Otat do not interfere w„h *ose processes are enr. he^ 

L «. sub-libra, is transfected ,n,o a targe, eel, under conditions wherem the ,arge cel, 

irirdtaamy differentiate into some more mature phenotype. SGEs ma, 

cond . t . ons)> bu( no , ^ subtracllve 

differentiation process in the target ceinor „ amD i. 
cells (or subtractive conditions,, can be iden,if,ed by selecnon ,„ thrs step. Thus, for xampfe 
25 SGEscan be identic which perm,, differentiation of embryonic stem cel,s, such as 

25 m « a,. o« ^ — - — - <i h 

nepa.ocytic lineages, bu, which inhibi, different ,o pancreauc, lun & or o^ , ssu 
d Ived from the pnmihve gu, Likewise, the subject method can be used to ,den , y SGE 
1 inhibi, different of neural crest s,em cells ,o a neuronal pheno.ype bu, no o 
30 ^h lia, or smooth muscle. As above, SGEs mat effect stem ce„ differe„,,a,,o„ m a selec,, 
Ilr mdicate me identity of endogenous genes tha, can themselves be the urge, of drug 
screening to develop sma.1 molecule agonists or antagonis, of , heir tacon. 
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In other embodiments, the subject method can be used to identify SGEs that inhibit a 
cellular response to an extracellular signal, such as a growth factor, cytokine, or other 
paracrine or autocrine factor. For example, TGFa is an important positive growth effector in 
malignant cells and plays a significant role in malignant progression. In one embodiment, the 
5 subject method first enriches for SGEs that do not inhibit growth of TGFa -responsive cells in 
the absence of TGFa, or enriches for SGEs that do not inhibit growth of cells which 
proliferate in a TGFa -independent manner. The subtracted SGE library is then transfected 
into TGFa-responsive cells, preferably transformed cells, in the presence of TGFa, and SGEs 
that inhibit growth under these conditions are identified. Similar assay formats can be used to 
10 identify SGEs that modify responsiveness of normal or transformed cells to other growth 
factors, cytokines, tropic factors or the like. 

The subject method can also be used to identify SGEs that confer sensitivity to drugs. 
For example, the method can be used to generate SGEs that confer sensitivity to 
chemotherapeutic agents, which can lead to both diagnostic and therapeutic approaches for 
1 5 drug resistant cancer cells. In other embodiments, the subject method is used to identify SGEs 
that confer sensitivity to drugs to a pathogen, such as a drug-resistant bacteria or fungus. 

Figure 5 depicts an approach to find SGEs which confer sensitivity to 
chemotherapeutic agents in cells that are normally resistant to this drug. In this example, the 
SGE library is constructed from drug resistant cancer cells. The variegated SGE library is 

20 introduced into the this cells in the absence of the drug (subtractor cells) to eliminate normal 
lethal SGEs. Next, the survivor resistant cells are grown in the presence of the drug (target 
cells) and death cells are collected. The SGEs are rescued, re-infected and rescued again from 
death cells until -100% of the infected cells die in the presence of the chemotherapeutic agent. 
These SGEs identify genes whose function is required to develop resistance against a 

25 chemotherapeutic agent. 

The same approach can be applied to find SGEs that restore the sensitivity of drug 
resistant microorganisms to drugs. In this example, the initial SGE library is constructed form 
the drug resistant strain in an appropriate expression vector system. Than, the variagated SGE 
library is introduced into drug resistant cells and non-specific, normal lethal SGEs are 

30 eliminated from the library by growing the cells in the absence of the drug (subtractor cells). 
Next, survivor cells are grown in the presence of the drug and death cells are collected. The 
same steps are repeated until -100% of the drug resistant cells transfected with the enriched 



-8- 
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0 f the drug These SGEs will confer sensitivity because they 
SGE library die in the presence of the drug. m» 

u wtion is essential to develop resistance to the drug, 
interfere with genes whose function is essentia 

u rt of the present invention provides nucleic acids and/or peptides that 

administered to an animal in order to inhibit gro 
effectiveness of a conjointly administered antiproliferative agent. 

• k Himents where the identified SGE encodes a peptide or protein, the 

In certain embodiments, wnere uw 

f QGE is used to generate an expression constructs. In 

- ir-: : ^ * — i;:::i:ri:i: 

15 nuckio acids can be provided as non-hydrdyxabie analogs. 

, „f the present invention provides a dmg screening assay for 

product identified as the source of a selective 
agricultural formulations. 



zf. De finitions 

F or convenience, certain terms emp.oyed in the specif.ca.ion, examples, and appended 

25 claims are collected here. 

• .„ ,„™ "vector" refers to a nucleic acid molecule capable of 
As M ed heretn the «» - ^ ^ ^ ^ a ^ 

— * ^ ^ 2 'which can become intend into the chromsoma, 
of the host eel, An of djrecling , he exprKsion of genes to that 

30 rr::z .r r-erred „u . — — • - - — 
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specification, "plasmid" and "vector" are used interchangeably unless otherwise clear from the 
context. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term 
5 should also be understood to include, as applicable to the embodiment being described, single- 
stranded (such as sense or antisense) and double-stranded polynucleotides. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
comprising an open reading frame encoding a polypeptide of the present invention, including 
both exon and (optionally) intron sequences. A "recombinant gene" refers to nucleic acid 
10 encoding such regulatory polypeptides, that may optionally include intron sequences that are 
derived from chromosomal DNA. The term "intron" refers to a DNA sequence present in a 
<*iven gene that is not translated into protein and is generally found between exons. As used 
herein, the term "transfection" means the introduction of a nucleic acid, e.g., an expression 
vector, into a recipient cell by nucleic acid-mediated gene transfer. 

15 A "protein coding sequence" or a sequence that "encodes" a particular polypeptide or 

peptide, is a nucleic acid sequence that is transcribed (in the case of DNA) and is translated (in 
the case of mRNA) into a polypeptide in vitro or in vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are determined by a 
start codon at the 5* (amino) terminus and a translation stop codon at the 3' (carboxy) terminus. 

20 A coding sequence can include, but is not limited to, cDNA from procaryotic or eukaryotic 
mRNA, genomic DNA sequences from procaryotic or eukaryotic DNA, and even synthetic 
DNA sequences. A transcription termination sequence will usually be located 3* to the coding 
sequence. 

However, as described below, the generic term "coding sequence" may refer to, as the 
25 context permits, sequences that are transcribed to produce RNA that is itself directly active (as 
a potential SGE), as opposed to a polypeptide translated therefrom. 

Likewise, "encodes", unless evident from its context, will be meant to include DNA 
sequences that encode a polypeptide, as the term is typically used, as well as DNA sequences 
that are transcribed into inhibitory antisense molecules. 

30 An "initial SGE library" is a library of coding sequences for potential synthetic genetic 

elements. 

A "subtracted SGE library"is the sub-library of the original SGE library that was 

-10- 
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generated by the infection of the library into subtractor cells to remove SGE's whose activity 
would interfere with the screening of the SGE library in the target cells. 

An "enriched SGE library" is a collection of SGE's that confers the desired phenotype 
in target cells. The enriched library is generated by repeated transfection — > selection — > 
5 amplification of the subtracted library in target cells. 

"Subtractive cells" are those cells that are used to remove those SGEs from a 
variegated SGE library that confer a non-desirable phenotype. 

"Target cells" are cells are used to identify SGEs from an SGE library that confer the 
desired phenotype on the target cells. 
10 A "normal lethal SGE" is a synthetic genetic element that is lethal to one or more 

subtractive cell-types. 

A "selective lethal SGE" is a synthetic genetic element that is lethal to a target cell- 
types, but not to a subtractive cell-type. 

The term "loss-of-function", as it refers to SGEs, refers to those SGEs that inhibit 
15 expression of a gene, or render the gene product thereof to have substantially reduced activity, 
or preferably no activity relative to one or more functions of the corresponding wild-type gene 
product. 

The term "expression" with respect to a gene sequence refers to transcription of the 
gene and, as appropriate, translation of the resulting mRNA transcript to a protein. Thus, as 
20 will be clear from the context, expression of a protein coding sequence results from 
transcription and translation of the coding sequence. On the other hand, "expression" of an 
antisense sequence or ribozyme will be understood to refer to the transcription of the 
recombinant gene sequence as it is the RNA product that is directly active. 

"Cells," "host cells" or "recombinant host cells" are terms used interchangeably herein. 
25 It is understood that such terms refer not only to the particular subject cell but to the progeny 
or potential progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in fact, 
be identical to the parent cell, but are still included within the scope of the term as used herein. 

By "recombinant virus" is meant a virus that has been genetically altered, e.g., by the 
30 addition or insertion of a heterologous nucleic acid construct into the particle. 
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As used herein, the terms "transduction" and "transfection" are art recognized and 
mean the introduction of a nucleic acid, e.g., an expression vector, into a recipient cell by 
nucleic acid-mediated gene transfer. "Transformation", as used herein, refers to a process in 
which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or 
5 RNA, and, for example, the transformed cell expresses a recombinant form of a polypeptide 
or, where anti-sense expression occurs from the transferred gene, the expression of a naturally- 
occurring form of a protein is disrupted. 

"Transient transfection" refers to cases where exogenous DNA does not integrate into 
the genome of a transfected cell, e.g., where episomal DNA is transcribed into mRNA and 
1 0 translated into protein. 

A cell has been "stably transfected" with a nucleic acid construct when the nucleic acid 
construct is capable of being inherited by daughter cells. 

As used herein, a "reporter gene construct" is a nucleic acid that includes a "reporter 
gene" operatively linked to at least one transcriptional regulatory sequence. Transcription of 
15 the reporter gene is controlled by these sequences to which they are linked. The activity of at 
least one or more of these control sequences can be directly or indirectly regulated by the 
target receptor protein. Exemplary transcriptional control sequences are promoter sequences. 
A reporter gene is meant to include a promoter-reporter gene construct that is heterologously 
expressed in a cell. 

20 As used herein, "transformed cells" refers to cells that have spontaneously converted to 

a state of unrestrained growth, i.e., they have acquired the ability to grow through an indefinite 
number of divisions in culture. Transformed cells may be characterized by such terms as 
neoplastic, anaplastic and/or hyperplastic, with respect to their loss of growth control. For 
purposes of this invention, the terms "transformed phenotype of malignant mammalian cells" 

25 and "transformed phenotype " are intended to encompass, but not be limited to, any of the 
following phenotypic traits associated with cellular transformation of mammalian cells: 
immortalization, morphological or growth transformation, and tumorigenicity, as detected by 
prolonged growth in cell culture, growth in semi-solid media, or tumorigenic growth in 
immuno-incompetent or syngeneic animals. 

30 

As used herein, "proliferating" and "proliferation" refer to cells undergoing mitosis. 



- 12- 



WO 00/53743 PCT/USOO/06385 

As used herein, "immortalized cells" refers to cells that have been altered via chemical, 
genetic, and/or recombinant means such that the cells have the ability to grow through an 
indefinite number of divisions in culture. 

The "growth state" of a cell refers to the rate of proliferation of the cell and the state of 
differentiation of the cell. 



HI Exemplary embodiments of Isolati on Method 

The SGEs identified by the present method may function to inhibit the function of an 
endogenous gene at the level of nucleic acids, e.g., by an antisense or decoy mechanism, or by 
encoding a polypeptide that is inhibitory through a mechanism of interference at the protein 
level, e.g., a dominant negative fragment of the native protein. On the other hand, certain 
SGEs may function to potentiate (including mimicing) the function of an endogenous gene by 
encoding a polypeptide which retains at least a portion of the bioactivity of the corresponding 
endogenous gene, and may in particular instances be constitutively active.. 

In one embodiment, the initial SGE library is generated from total cDNA, that may be 
further fragmented, and provided in the form of an expression library. Preferably, the inserts 
in the library will range from about 100 bp to about 700 bp and more preferably, from about 
200 bp to about 500 bp in size. 

For cDNA-derived libraries, the nucleic acid library can be a normalized library 
containing roughly equal numbers of clones corresponding to each gene expressed in the cell 
type from which it was made, without regard for the level of expression of any gene. 

The initial SGE libraries can be generated to include both sense and antisense coding 
(and non-coding sequences) sequences. Transcription of the SGE sequence in the subtractive 
and target cells will create antisense RNA that may inhibit transcription of the corresponding 
endogenous gene. Translation of appropriate protein coding sequences in the transcribed RNA 
can produce full-length and truncated forms of endogenous proteins, as well as short peptides, 
the differential biological effects of that are assessed in the subtractive and target cells. 

US Patent 5,702,898 describes a method to normalize a cDNA library constructed in a 
vector capable of being converted to single-stranded circles and capable of producing 
complementary nucleic acid molecules to the single-stranded circles comprising: (a) 
converting the cDNA library in single-stranded circles; (b) generating complementary nucleic 
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acid molecules to the single-stranded circles; (c) hybridizing the single-stranded circles 
converted in step (a) with complementary nucleic acid molecules of step (b) to produce partial 
duplexes to an appropriate Cot; (e) separating the unhybridized single-stranded circles from 
the hybridized single-stranded circles, thereby generating a normalized cDNA library. 

5 In certain embodiments, the initial SGE library can be a subtractive cDNA library. 

Many strategies have been used to create subtractive libraries, and can be readily adapted for 
use in the present method. One approach is based on the use of directionally cloned cDNA 
libraries as starting material (Palazzolo and Meyerowitz, (1987) Gene 52:197; Palazzolo et al. 
(1989) Neuron 3:527; Palazzolo et al. (1990) Gene 88:25). In this approach, cDNAs prepared 
10 from a first source tissue or cell line are directionally inserted immediately downstream of a 
bacteriophage T7 promoter in the vector. Total library DNA is prepared and transcribed in 
vitro with T7 RNA polymerase to produce large amounts of RNA that correspond to the 
original mRNA from the first source tissue. Sequences present in both the source tissue and 
another tissue or cells, such as normal tissue, are subtracted as follows. The in vitro transcribed 

15 RNA prepared from the first source is allowed to hybridize with cDNA prepared from either 
native mRNA or library RNA from the second source tissue. The complementarity of the 
cDNA to the RNA makes it possible to remove common sequences as they anneal to each 
other, allowing the subsequent isolation of unhybridized, presumably tissue-specific, cDNA. 
This approach is only possible using directional cDNA libraries, since any cDNA sequence in 

20 a non-directional library is as likely to be in the "sense" orientation as the "antisense" direction 
(sense and antisense are complementary to each other). A cDNA sequence unique to a tissue 
would be completely removed during the hybridization procedure if both sense and antisense 
copies were present. 

In one directional cloning strategy, which can be used to generate an initial SGE 
25 library, a DNA sequence encoding a specific restriction endonuclease recognition site (usually 
6-10 bases) is provided at the 5' end of an oligo(dT) primer. This relatively short recognition 
sequence does not affect the annealing of the 12-20 base oligo(dT) primer to the mRNA, so the 
cDNA second strand synthesized from the first strand template includes the new recognition 
site added to the original 3' end of the coding sequence. After second strand cDNA synthesis, a 
30 blunt ended linker molecule containing a second restriction site (or a partially double stranded 
linker adapter containing a protruding end compatible with a second restriction site) is ligated 
to both ends of the cDNA. The site encoded by the linker is now on both ends of the cDNA 
molecule, but only the 3' end of the cDNA has the site introduced by the modified primer. 
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Following the linker ligation step, the product is digested with both restriction enzymes (or, if 
a partially double stranded linker adapter was ligated onto the cDNA, with only the enzyme 
that recognizes the modified primer sequence). A population of cDNA molecules results which 
all have one defined sequence on their 5' end and a different defined sequence on their 3' end. 

5 A related directional cloning strategy developed by Meissner et al. (1987) PNAS 

84:4171), requires no sequence-specific modified primer. Meissner et al. describe a double 
stranded palindromic BamHI/Hindlll directional linker having the sequence 
d(GCTTGGATCCAAGC), that is ligated to a population of oligo(dT)-primed cDNAs, 
followed by digestion of the ligation products with BamHI and Hindlll. This palindromic 

10 linker, when annealed to double stranded form, includes an internal BamHI site (GGATCC) 
flanked by 4 of the 6 bases that define a Hindlll site (AAGCTT). The missing bases needed to 
complete a Hindlll site are d(AA) on the 5' end or d(TT) on the 3' end. Regardless of the 
sequence to which this directional linker ligates, the internal BamHI site will be present. 
However, Hindlll can only cut the linker if it ligates next to an d(AA):d(TT) dinucleotide base 

15 pair. In an oligo(dT)-primed strategy, a Hindlll site is always generated at the 3' end of the 
cDNA after ligation to this directional linker. For cDNAs having the sequence d(TT) at their 5' 
ends (statistically 1 in 16 molecules), linker addition will also yield a Hindlll site at the 5' end. 
However, because the 5" ends of cDNA are heterogeneous due to the lack of processivity of 
reverse transcriptases, cDNA products from every gene segment will be represented in the 

20 library. 

In other embodiments, the SGE library is generated from genomic DNA fragments. 
Preferably, the inserts in the library will range from about 100 bp to about 700 bp and more 
preferably, from about 200 bp to about 500 bp in size. Such SGE libraries, in addition to 
encoding polypeptide and antisense molecules that may be functional SGEs in the test method, 
25 may also "encode" decoy molecules, e.g., nucleic acid sequences which correspond to 
regulatory elements of a gene and which can inhibit expression of the gene by sequestering, 
e.g., transcriptional factors, and thereby competing for the necessary components to express 
the endogenous gene. 

In yet another embodiment, the SGE library is generated by randomly fragmenting a 
30 single gene to obtain a random fragment expression library derived exclusively from the gene 
of interest. As a practical matter, such a library will contain a much greater variety of SGEs 
derived from the gene of interest than will a random fragment library prepared from total 
cDNA. Consequently, the likelihood of obtaining optimized SGEs, that have a differential 
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activity according to the present method, from the single gene random fragment library is 
much higher. 

In one embodiment, purified DNA corresponding to the gene or genome to be 
suppressed is first randomly fragmented by enzymatic, chemical, or physical procedures. In a 
5 preferred embodiment, random fragments of DNA are produced by treating the DNA with a 
nuclease, such as DNase L The random DNA fragments are incorporated as inserts in a SGE 
library. For general principles of DNase I partial digestion and library construction see 
Molecular Cloning, A Laboratory Manual, Sambrook et al., Eds., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1989). In certain embodiments the inserted fragment 

1 0 may be expressed as part of a fusion protein. In other embodiments the inserted fragment alone 
may be expressed. In another embodiment, ribozyme-encoding sequences may be inserted 
directly adjacent to the insert to allow for selection of most efficient ribozyme-antisense 
clones. In still other embodiments the gene suppression element library may be further 
modified by random mutagenesis procedures known in the art. The inserted fragments may be 

15 expressed from either a constitutive or an inducible promoter. 

In still another embodiment, the subject method is carried out with a SGE library 
encoding a variegated population of small peptides, e.g., 4-25 amino acid residues in length. 
The library can be generated from coding sequences of total cDNA, or single genes, or can be 
random or semi-random in sequence. Small peptide fragments, corresponding to only a 
20 minute portion of a protein, can inhibit the function of that protein in vivo. 

There are a wide range of expression constructs which can be used to express the 
subject SGEs in the present method. In preferred embodiments, the SGE library is provided in 
a vector which can be transfected, and the library expressed, in both the target and subtractive 
cells. Moreover, it will generally be desirable that the vector be recoverable from the cells. 

25 The SGEs can be recombinantly expressed using, e.g, expression vectors containing 

the SGE coding sequence operably linked to at least one transcriptional regulatory sequence. 
Operably linked is intended to mean that the nucleotide sequence is linked to a regulatory 
sequence in a manner which allows expression of the nucleotide sequence. Regulatory 
sequences are art-recognized and are selected to direct expression of the SGE sequence in the 

30 subtractive and target cells. Accordingly, the term transcriptional regulatory sequence 
includes promoters, enhancers and other expression control elements. Such regulatory 
sequences are described in Goeddel; Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, CA (1990). For instance, any of a wide variety of 
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expression control sequences may be used in the subject expression vectors, such as a viral 
LTR (e.g., the LTR of the Moloney murine leukemia virus), the early and late promoters of 
SV40, adenovirus or cytomegalovirus (CMV) immediate early promoter, the promoter for 3- 
phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., 
5 Pho5, the promoters of the yeast a-mating factors, the polyhedron promoter of the baculovirus 
system and other sequences known to control the expression of genes in prokaryotic or 
eukaryotic cells or their viruses, and various combinations thereof. It should be understood 
that the design of the expression vector may depend on such factors as the choice of the host 
cell to be transfected and/or the type of protein desired to be expressed. Moreover, the vector's 
1 0 copy number, the ability to control that copy number and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, should also be considered. 

Approaches to expressing the SGE library in subtractive and target cells include 
insertion of the SGE coding sequence in viral vectors including recombinant retroviruses, 
adenovirus, adeno-associated virus, and herpes simplex virus-1, or recombinant bacterial or 
15 eukaryotic plasmids. Viral vectors transfect cells directly; plasmid DNA can be delivered 
with the help of, for example, cationic liposomes (lipofectin), polylysine conjugates, 
gramacidin S, artificial viral envelopes or other such intracellular carriers, as well as direct 
injection of the gene construct or calcium phosphate precipitation carried out in vivo. It will be 
appreciated that because the efficiency of transduction of the target and subtractive cells can 
20 represent a critical step in sampling the SGE library, choice of the particular transfection 
system will depend on such factors as the phenotype of the intended target. 

A preferred approach for introduction of the SGE library into a subtractive or target 
cell is by use of a viral vector. Infection of cells with a viral vector has the advantage that a 
large proportion of the targeted cells can receive the nucleic acid. Additionally, the SGE 
25 coding sequences contained in the viral vector can be expressed efficiently in cells that have 
taken up viral vector nucleic acid. 

Retrovirus vectors are a prefered recombinant delivery system for the transfer of the 
SGE constructs into mammalian cells, particularly into humans. These vectors provide 
efficient delivery of genes into cells, and the transferred nucleic acids are stably integrated into 
30 the chromosomal DNA of the host. 

In preferred uses of retroviruses, e.g., to ensure the safety of their use, the vectors will 
used in conjunction with specialized cell lines (termed "packaging cells") which produce only 
replication-defective retroviruses (for a review see Miller, A.D. (1990) Blood 76:271). Thus, 
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recombinant retrovirus can be constructed in which part of the retroviral coding sequence (gag, 
pol, env) has been replaced. The replication defective retrovirus is then packaged into virions 
which can be used to infect a target cell through the use of a helper virus by standard 
techniques. Protocols for producing recombinant retroviruses and for infecting cells in vitro 
5 with such viruses can be found in Current Protocols in Molecular Biology Ausubel, F.M. et al. 
(eds.) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard laboratory 
manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE and pEM that are well 
known to those skilled in the art. Examples of suitable packaging virus lines for preparing 
both ecotropic and amphotropic retroviral systems include Crip, Cre, 2 and Am. Retroviruses 
0 have been used to introduce a variety of genes into many different cell types in vitro (see for 
example Eglitis, et al. (1985) Science 230:1395-1398; Danos and Mulligan (1988) Proc. Natl. 
Acad. Sci. USA 85:6460-6464; Wilson et al. (1988) Proc. Natl. Acad. Sci. USA 85:3014-3018; 
Armentano et al. (1990) Proc. Natl. Acad. Sci. USA 87:6141-6145; Huber et al. (1991) Proc. 
Natl. Acad. Sci. USA 88:8039-8043; Ferry et al. (1991) Proc. Natl. Acad. Sci. USA 88:8377- 
5 8381; Chowdhury et al. (1991) Science 254:1802-1805; van Beusechem et al. (1992) Proc. 
Natl. Acad. Sci. USA 89:7640-7644; Kay et al. (1992) Human Gene Therapy 3:641-647; Dai et 
al. (1992) Proc. Natl. Acad. Sci. USA 89:10892-10895; Hwu et al. (1993) J. Immunol. 
150:4104-4115; U.S. Patent No. 4,868,116; U.S. Patent No. 4,980,286; PCT Application WO 
89/07136; PCT Application WO 89/02468; PCT Application WO 89/05345; and PCT 
0 Application WO 92/07573). 

In certain preferred embodiments, the SGE library is cloned into a retroviral expression 
vector to create a variegated vector library. Exemplary vectors for use in the present method 
are described in, for example, US Patents 5,753,432, 5,665,550, 5,206,352, and PCT 
publications WO 98/12339. 

The SGE library is used to transfect the subtractive cells, e.g., by introducing the 
library into the cells by procedures appropriate to the vector chosen and well known in the art. 
See, e.g., Keown et al., Methods Enzymol. 185:527-536 (1990). The genetically modified 
subtractive cells containing SGEs can be screened for or selected in a variety of ways. 

The transfected subtractive cells are cultured for a period of time sufficient for the cells 
to at least develop the desired phenotype, or one which interferes with its detect, in order to 
permits selection of cells expressing SGEs which do not exhibit such phenotypes. 

For example, where the desired phenotype is lethality, cells that are viable after 
expression of the SGE library can be amplified in the culture and isolated after a sufficient 
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number of passages for the normal lethal SGEs to kill their host cells. The viable cells can be 
isolated from the culture supernatant, e.g., by centrifugation, and the SGE vectors isolated 
from the intact cells. However, it may be desirable to separate the cells before death and lysis 
of the cells expressing lethal SGEs occurs. In such embodiments, cells can be differentially 
5 sorted, e.g., by fluorescence activated cell sorters (FACs), affinity purification techniques or 
other well known markers. For example, the ability of viable cells to incorporate BrdU can 
provide for a FACs-selectable marker. 

Continuing with the illustrative example of selective lethality, in certain embodiments 
it may be necessary to stimulate the subtractive cells with a mitogen(s), or by repassage in 
10 order to cause proliferation. In addition to providing a proliferative signal against which the 
anti-proliferative abilities of the SGE library are selected, such stimulation can give rise to 
changes in cell markers, such as cell surface proteins, which can be detected by FACs, or 
changes in expressions of a reporter gene. 

In certain embodiments, a heterologous' reporter gene construct can be used to provide 
1 5 the function of an indicator gene. Reporter gene constructs are prepared by operatively linking 
a reporter gene with at least one transcriptional regulatory element which is, e.g., activated by 
mitogenic signals. Many reporter genes and transcriptional regulatory elements are known to 
those of skill in the art and others may be identified or synthesized by methods known to those 
of skill in the art. 

20 Examples of reporter genes include, but are not limited to CAT (chloramphenicol 

acetyl transferase) (Alton and Vapnek (1979), Nature 282: 864-869) luciferase, and other 
enzyme detection systems, such as beta-galactosidase; firefly luciferase (deWet et al. (1987), 
Mol. Cell. Biol. 7:725-737); bacterial luciferase (Engebrecht and Silverman (1984), PNAS 1: 
4154-4158; Baldwin et al. (1984), Biochemistry 23: 3663-3667); alkaline phosphatase (Toh et 
25 al. (1989) Eur. J. Biochem. 182: 231-238, Hall et al. (1983) J. Mol. Appl. Gen. 2: 101), human 
placental secreted alkaline phosphatase (Cullen and Malim (1992) Methods in Enzymol. 
216:362-368); p-lactamase or GST. 

Transcriptional control elements for use in the reporter gene constructs, or for 
modifying the genomic locus of an indicator gene include, but are not limited to, promoters, 
30 enhancers, and repressor and activator binding sites. Again, using the selective lethality 
embodiments to illustrate, suitable transcriptional regulatory elements may be derived from the 
transcriptional regulatory regions of genes whose expression is rapidly induced, generally 
within minutes, of contacting the cell with a mitogenic agent. Examples of such genes include, 
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but are not limited to, the immediate early genes (see, Sheng et al. (1990) Neuron 4: 477-485), 
such as c-fos. Immediate early genes are genes that are rapidly induced upon mitogenic 
stimulation. The transcriptional control elements that are preferred for use in the gene 
constructs include transcriptional control elements from immediate early genes, elements 
5 derived from other genes that exhibit some or all of the characteristics of the immediate early 
genes, or synthetic elements that are constructed such that genes in operative linkage therewith 
exhibit such characteristics. The characteristics of preferred genes from which the 
transcriptional control elements are derived include, but are not limited to, low or undetectable 
expression in quiescent cells, rapid induction at the transcriptional level within minutes of 
10 mitogenic simulation, induction that is transient and independent of new protein synthesis, 
subsequent shut-off of transcription requires new protein synthesis, mRNAs transcribed from 
these genes have a short half-life, and lack of expression in apoptotic cells. It is not necessary 
for all of these properties to be present. 

Other promoters and transcriptional control elements, in addition to those described 
15 above, include the vasoactive intestinal peptide (VIP) gene promoter (cAMP responsive; Fink 
et al. (1988), Proc. Natl. Acad. Sci. 85:6662-6666); the somatostatin gene promoter (cAMP 
responsive; Montminy et al. (1986), Proc. Natl. Acad. Sci. 8.3:6682-6686); the proenkephalin 
promoter (responsive to cAMP, nicotinic agonists, and phorbol esters; Comb et al. (1986), 
Nature 323:353-356); the phosphoenolpyruvate carboxy-kinase gene promoter (cAMP 
20 responsive; Short et al. (1986), J. Biol. Chem. 261:9721-9726); the NGFI-A gene promoter 
(responsive to NGF, cAMP, and serum; Changelian et al. (1989). Proc. Natl. Acad. Sci. 
86:377-381); and others that may be known to or prepared by those of skill in the art. 

In the case of receptors which modulate cyclic AMP, for example, a transcriptional 
based readout can be constructed using the cyclic AMP response element binding protein, 
25 CREB, that is a transcription factor whose activity is regulated by phosphorylation at a 
particular serine (SI 33). When this serine residue is phosphorylated, CREB binds to a 
recognition sequence known as a CRE (cAMP Responsive Element) found to the 5' of 
promotors known to be responsive to elevated cAMP levels. Upon binding of phosphorylated 
CREB to a CRE, transcription from this promoter is increased. 

30 Therefore, a transcriptionally-based readout can be constructed in cells containing a 

reporter gene whose expression is driven by a basal promoter containing one or more CRE. 
Changes in the intracellular concentration of Ca~ (e.g., a result of alterations in the activity of 
the growth factor receptor upon engagement with a growth factor) will result in changes in the 
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level of expression of the reporter gene if: a) CREB is also co-expressed in the cell, and b) 
either an endogenous or heterologous CaM kinase phosphorylates CREB in response to 
increases in calcium or if an exogenously expressed CaM kinase II is present in the same cell. 
In other words, mitogenic stimulation of the cell may result in phosphorylation of CREB and 

5 increased transcription from the CRE-construct, while inhibition of PLC activity may result in 
decreased transcription from the CRE-responsive construct. 

In the exemplary embodiments of the subject method for score for SGEs which 
selectively inhibit or potentiate growth factor, reporter gene readouts as described above can 
be employed, as can direct detection of second messanger formation. 

10 Whatever the ultimate method, subtractive cells which do not develop the desired 

phenotype upon expression of an SGE are isolated, and the SGE sub-library representative of 
those cells is isolated by standard protocols. The isolated vectors are then transfected into the 
target cells - those cells for which selective SGEs are desired. 

As above, the target cells are cultured for a time sufficient for the desired phenotype to 

1 5 develop as a detectable signal. For instance, in one of the selective lethal embodiments, where 
cell lysis occurs due to apoptosis, those vectors encoding lethal SGEs can be isolated from the 
cell debris, e.g., from the culture supernatant. However, a more pragmatic approach is to 
identify the lethal phenotype before the cells lyse, and isolate the SGE vectors from a reltively 
intact, albeit dying, cell. 

20 Thus, in one embodiment, cells expressing lethal SGEs are isolated on the basis of 

expression of an early marker for apoptosis. To illustrate, the apoptotic activity of the SGE 
library can be detected using a commercially available kit, Apoptosis Detection Kit, from R & 
D SYSTEMS (Minnesota) in which annexin V, a member of the calcium and phospholipid 
binding proteins, is used to detect apoptosis, e.g., following the protocol recommended by the 

25 manufacturer. Fluorescein-labeled annexin V and propidium iodide are added to the cells. The 
cells expressing phosphatidylserine on the outer leaflet of cell membranes bind annexin V, and 
cells with a compromised cell membrane allow propidium iodide to bind to the cellular DNA. 
The resulting cells, when immediately analyzed by flow cytometry, can present three potential 
populations of cells: live cells which will not stain with either fluorochrome, necrotic cells 

30 which will stain with both fluorochrome, and cells undergoing apoptosis which will stain only 
with the annexin V-FITC reagent. Analysis, and cell separation, can be performed on 
cytometers equipped with a single laser emitting excitation light at 488 nm. Thus, SGEs which 
selectively induce apoptosis can be distinguished from those which cause necrotic cell death. 
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Another potential approach to isolate cells expressing lethal SGEs is to collect 
"floating" cells and rescue the SGEs from those cells. 

Alternatively, the cells can be loaded with small, cell permeable, fluorogenic caspase 
substrates. The activation of apoptotic caspases can be detected in vivo by the cleavage of the 
5 subtrate that increases fluorescence. Fluorescent cells can be identified and separated by 
FACS (Packard BZ et al., Proc.NatLAcad.Sci., 93:1 1640-1 1645, 1996). 

In the subject method, SGEs are finally obtained from the selected substractive or 
target cells by procedures known in the art. In one embodiment, the SGE is isolated by use of 
the polymerase chain reaction with DNA obtained from the selected cells and with primers 
10 homologous to sites on the vector flanking the insert. In another embodiment, the SGE 
expression library may be prepared in shuttle vectors, allowing efficient recovery of shuttle 
vectors containing SGEs. Finally, SGEs can be isolated by standard cloning techniques well 
known in the art using vector specific probes although this might be more laborious than other 
embodiments herein described. 

15 After identification in the target cell phase of the method, the isolated SGE inserts of 

the expression library may be sequenced. Alternatively, and preferably, the rescued library 
clone may be further tested for its ability to selectively confer the desired phenotype in other 
selection assays, prior to nucleotide sequence determination. Determination of the nucleotide 
sequence, of course, results in the identification of the SGE. 

20 A. Selective killing of HIV infected CD4 + T lvmpocvtes: 

Following the first burst of viral replication that immediately follows HIV-1 infection, 
there is a relatively long period of clinical latency (8 to 12 years). During this time the viral 
load is relatively stable which reflect to a relatively constant rate of new infection and death of 
the infected cells. The virus infected CD4+ cells dislay impaired T-cell functions (i.e. colony 

25 formation, expression of IL-2 receptor, etc.), antigen-specific responses, mitogen/antigen 
induced cell proliferation or signal transduction. The cellular changes are caused by the 
expressed viral regulatory proteins such as nef, tat, vpu, env serve the better adaptation of the 
infected cells to the needs of the virus. Since the physiology of the HIV-1 infected CD4+ T 
lymphocytes is different from the physiology of the normal cells these differences can be 

30 utilized to identify genes whose function is essential for the survival of the virus infected cells 
but not essential for the the survival of normal cells. 



-22- 



WO 00/53743 PCT/US00/06385 

For this application the SGE library is made from HIV-1 infected CD4+ T 
lymphocytes. The library is transfected into the subtractor, normal CD4+ T lymphocytes and 
the transfected cells are allowed to undergo at least two rounds of cells division. This step 
leads to the subtraction of the "normal lethal SGEs" from the library. Living cells are 
5 separated from death cells on a Percoll gradient. SGEs are rescued from the living cells and 
used to transfect the target cells, HIV-1 infected CD4+ T lymphocytes. SGEs are rescued 
from early apoptotic cells that have been isolated by one of the methods described above. This 
enriched SGE library is used to repeat the selection cycle. The selection cycles are continued 
until -100% of the transfected cell undergo apoptosis. At this point individual SGE's are 
10 isolated and used to verify the selective phenotype in normal and HIV-1 infected CD4+ T 
lymphocytes. The selectivity of the SGE is further tested on a panel of normal cells. The ideal 
SGE inhibits a cellular gene whose function is not essential in any of the normal cell lines. 
Small molecular weight inhibitors of this gene would selectively kill HIV-1 infected CD4+ 
cells by targeting a cellular and not a viral target. 

15 

B. Reeulation of cell differentiation: 

Rat PC 12 pheochromocytoma (teratocarcinoma) cells are widely used to study cell 
differentiation. In the presence of EGF (epidermal growth factor) these cells proliferate but the 
addition of NGF (nerve growth factor) induces their neuronal differentiation and they become 

20 neurons. To isolate SGEs which interfere with the NGF induced neuronal differentiation of 
PC 12 cells, the variagated SGE library is made from NGF treated PC 12 cells at various stages 
of neuronal differentiation. This SGE library is introduced into PC 12 cell proliferating in the 
presence of EGF (subtractor cell). The infected cells are allowed to undergo at least two 
rounds of cell division. This step leads to the subtraction of "normal lethal SGEs" from the 

25 SGE library. SGEs that might cause the differentiation of PC12 cells into neurons in the 
presence of EGF would also be depleted from the library using FACS or magnetic beads as 
well as antibodies against neuronal cell specific surface markers such as voltage-gated Na+, 
K+, Ca2+ channels, or glutamate receptor or other neurotransmitter receptors. At this point, 
the subtracted SGE library is rescued from the depleted cells that do not display any neuronal 

30 cell specific phenotype and is introduced into PC 12 cells. The differentiation of the subtracted 
SGE library transformed Pcl2 cells is induced by NGF. Cells displaying neuronal cell specific 
surface markers are depleted and SGEs are rescued from cells that do not display any neuronal 
phenotype. This enriched library is reintroduced into PC12 cells and the infected cells are 
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induced to differentiate into neurons. Again, SGEs are rescued from cells that do not display 
any neuronal specific phenotype. The cycle of reinfection, selection and SGE rescue is 
repeated until the NGF induced differentiation of -100% of the transfected PC12 cells is 
inhibited (Figure 6). 

5 Figure 7 illustrates another example to isolate SGEs which interfere with the neuronal 

differentiation of embryonic stem (ES) cells. Mouse ES cells can be induced to differentiate 
into neuronal and muscle cells. The differentiated neuronal cells display neuronal cell specific 
markers (see above) when ES cells are cultured in aggregates (embryoid bodies) and exposed 
to retinoic acid. To induce the neuronal differentiation of these cells, rapidly growing 

1 0 undifferentiated ES cells are gently trypsinized. The cell suspension is incubated for two days 
during which the floating aggregates are formed. The aggregates are incubated for two 
additional days and than fresh media is added with 5xl0-7M all trans retinoic acid (RA). The 
aggregates are incubated for four days in ther presence of RA (fresh media is added after two 
days). By the end of this eight days incubation period neuronal cells are formed and single 

1 5 cell suspension can be generated by trypsinization for further analysis. 

To induce the differentiation of ES cells into myocytes and ultimately myotubes, ES 
cells are cultivated in hanging drops (800 cells/20ul) for two days. The developed embryoid 
bodies are cultivated in suspension for another three days, at which point single embryoid 
bodies are transfered to 24-well tissue culture plates to promote attachment. Differentiated 
20 myocytes can be isolated five days later by trypsinization. 

The SGE library is constructed from differentiated neuronal cells to isolate SGEs 
which inhibit the differentiation of ES cells into neurons. Than the variagated SGE library 
will be introduced into ES cells and the infected cells are differentiated into myocytes 
(subtractor cells). Than, differentiated myocytes are collected and the subtracted SGE library 

25 will be rescued. The subtracted SGE library is introduced again into ES cells and neuronal 
differentiation is induced. Differentiated neuronal cells are depleted using antibodies again 
neuronal cell specific surface markers and SGEs are rescued from the undifferentiated cells. 
This enriched SGE library will be introduced into ES cells again and the infected cells are 
induced to differentiate into neurons. SGEs will be rescued from undifferentiated cells and 

30 reintroduced into fresh ES cells. This cycle is repeated until -100% of the transfected ES cells 
which are induced to differentiate into neurons remain undifferentiated. These SGEs interfere 
with the function of genes whose activity plays an essential role in the neuronal differentiation 
of ES cells. 
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C. Selective killing of prion infec ted neuronal cells: 

Prions are infectious agents widely implicated in a variety of mammalian 
neurodegenerative diseases generally referred to as transmissible spongiform encephalopathies 

5 (for example Creutzfeldt-Jacob disease). In the brain of the infected patients there is plaque 
formation, the deposition of birefringent rods and fibrillar structures that destroy the host 
neurons and ultimately leads to death. The infectivity of prions is primarily associated with an 
aberrant conformation of a host protein, the prion protein, induced by the prion itself. The 
cellular prion protein is encoded by the cellular PrP gene and its normal function is unknown. 

10 Figure 8 depicts a therapeutic approach to isolate SGEs which selectively kill prion 

infected neurons. For this application the SGE library is constrcuted from prion infected 
neurons. This variagated SGE library is introduced into normal neurons (subtracter cells) in 
order to subtract the normal lethal SGEs from the library. The subtracted SGE library is than 
introduced into prion infected neurons (target cells) and dying cells are collected. The SGEs 

1 5 rescued from the dying cells are reintroduced into prion infected neurons and dying cells are 
collected. This cycle is continued until -100% of the prion infected cells die following the 
introduction of the enriched SGE library. These SGEs identify genes whose function is 
essential for the survival of prion infected cells but not essential for the survival of normal 
cells. 

20 

In addition to being a method for isolating SGEs that are themselves antiproliferative, 
as described above, the subject method can be carried out in a manner designed to identify 
SGEs which, rather than being acutely lethal themselves, confer an increased sensitivity to 
another agent, e.g., a cytotoxic agent. Identification of SGEs which selectively increase the 
25 sensitivity of a tumor cell to a chemotherapeutic agent is desirable. 

In an illustrative embodiment, the chemotherapeutic agent is added to target cell 
culture and, optionally, the subtractive cell culture. Upon additions of the chemotherapeutic 
agent to the subtractive cell culture, cells that are isolated are those that express an SGE and 
survive beyond, e.g. the T, /2 for the untransfected subtractive cells in the presence of the 
30 chemotherapeutic agent. Likewise, in the target cell culture, cells are isolated that are killed in 
some period of time less than the T 1/2 for the untransfected target cells. For example, isolating 
the cell populations which would undergo apoptosis with a T 1/2 of 1 or 2 c less than the T 1/2 
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for the untransfected cells should be enriched for SGEs which increase the sensitivity of the 
target cells for the chemotherapeutic agent. 



iv. Drug Screening Assays 

5 Another aspect of the invention is directed to the identification of agents capable of 

selectively modulating the phenotype of a cell, particularly the target cell or similar cells. For 
example, the subject method contemplates that agents can be developed which mimic the 
effect and selectivity of the identified SGE on the growth state of cells, e.g., of differentiation 
and proliferation, by targeting the endogenous gene or gene product (the "target gene" and 

10 "target gene product") which corresponds to the identified SGE. These agents include, but are 
not limited to, compounds that either potentiate or inhibit an intrinsic enzymatic activity of a 
target gene product or a complex including a target gene product, compounds that interfere 
with the interaction of a target gene product with other protein(s) or nucleic acid, and 
compounds comprising forms of a target gene product that are altered (mutated) to provide 

15 dominant loss-of-function or gain-of- function activity. The particular activity will depend on 
the characteristics of the target gene and/or target gene product identified in the subject 
method. 

Potential targets identified by the screen: 

1. Known "drugable" targets: known biochemical entities 

20 - immeditely "assayable": known enzyme/substrate pairs, appropriate for 

HTS 

- known chemistry 

2. Unknown but "drugable": target not known but sequence suggest biochemical 
activity 

25 - not immediately assayable/no known substrate 

- known chemistry 

3. "Non-drugable": no obvious biocemical activity 

- not "assayable" 

- no chemistry 

30 - can be used in gene therapy 
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In this regard, the present invention provides assays for identifying agents that are 
either agonists or antagonists of the normal cellular function of a target gene product, or of the 
role of that target gene product in the pathogenesis of normal or abnormal cellular function 
such as proliferation and/or differentiation, and disorders related thereto. Compounds 
5 identified by the present assay can be used, for example, in the treatment of such disorders. 

Agents to be tested for their ability to act as agonists or antagonists of a target gene 
product can be produced, for example, by bacteria, yeast or other organisms (e.g. natural 
products), produced chemically (e.g. small molecules, including peptidomimetics), or 
produced recombinantly. In a preferred embodiment, the test agent is a small organic molecule 
10 having a molecular weight of less than about 2,000 daltons. A high speed screen for agents 
that bind directly to the target gene product may employ immobilized or "tagged" 
combinatorial libraries (or libraries which otherwise readily deconvoluted) of, e.g., small 
organic molecules. 

Agents that are identified as active in the drug screening assay are candidates to be 

1 5 tested for their capacity to effect whole cells or tissue in vitro or in vivo. 

A variety of assay formats will suffice and, in light of the present disclosure, those not 
expressly described herein will nevertheless be comprehended by one of ordinary skill in the 
art. For instance, the assay can be generated in many different formats, and include assays 
based on cell-free systems, e.g. purified proteins or cell lysates, as well as cell-based assays 

20 which utilize intact cells. Simple binding assays can also be used to detect agents which, such 
as those which detect compounds able to potentiate or disrupt protein-protein or protein-DNA 
interaction involving a target gene product or target gene. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of compounds 

25 surveyed in a given period of time. Assays of the present invention that are performed in cell- 
free systems, such as may be derived with purified or semi-purified proteins or with lysates, 
are often preferred , as "primary" screens in that they can be generated to permit rapid 
development and relatively easy detection of an alteration in a molecular target that is 
mediated by a test compound. Moreover, the effects of cellular toxicity and/or bioavailability 

30 of the test compound can be generally ignored in the in vitro system, the assay instead being 
focused primarily on the effect of the drug on the molecular target as may be manifest in an 
alteration of binding affinity with other proteins or changes in enzymatic properties of the 
molecular target. 

-27- 



WO 00/53743 PCT/US00/06385 
Accordingly, in an exemplary screening assay of the present invention, a reaction 
mixture is generated to include a "target protein" (e.g., one form of a target gene product), test 
compound(s), and a "binding partner", e.g., a protein or nucleic acid which interacts with the 
target protein or that is a substrate of an enzymatic activity of the target protein. Detection and 
5 quantification of interaction, or substrate conversion (as appropriate) of the target protein with 
the binding partner provides a means for determining a compounds efficacy at inhibiting or 
potentiating interaction between the target protein and the binding partner. The efficacy of the 
compound can be assessed by generating dose response curves from data obtained using 
various concentrations of the test compound. Moreover, a control assay can also be performed 
10 to provide a baseline for comparison. In the control assay, interaction of the target protein and 
binding partner is quantitated in the absence of the test compound. 

Interaction between the target protein and the binding partner may be detected by a 
variety of techniques. Modulation of the formation of complexes can be quantitated using, for 
example, detectably labeled proteins such as radiolabeled, fluorescently labeled, or 
15 enzymatically labeled target proteins, by immunoassay, by chromatographic detection, or by 
detecting the intrinsic activity of the acetylase. 

Typically, it will be desirable to immobilize either the target protein or the binding 
partner to facilitate separation of complexes from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of target protein to the 

20 binding partner, in the presence and absence of a candidate agent, can be accomplished in any 
vessel suitable for containing the reactants. Examples include microtitre plates, test tubes, and 
micro-centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a 
domain that allows the protein to be bound to a matrix. For example, glutathione-S- 
transferase/target protein (GST/target protein) fusion proteins can be adsorbed onto glutathione 

25 sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, 
that are then combined with the cell lysates, e.g. an 35 S-labeled, and the test compound, and 
the mixture incubated under conditions conducive to complex formation, e.g. at physiological 
conditions for salt and pH, though slightly more stringent conditions may be desired. 
Following incubation, the beads are washed to remove any unbound label, and the matrix 

30 immobilized and radiolabel determined directly (e.g. beads placed in scintillant), or in the 
supernatant after the complexes are subsequently dissociated. Alternatively, the complexes can 
be dissociated from the matrix, separated by SDS-PAGE, and the level of binding partner 
found in the bead fraction quantitated from the gel using standard electrophoretic techniques. 
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Other techniques for immobilizing proteins and other molecules on matrices are also 
available for use in the subject assay. For instance, either the target protein or binding partner 
can be immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated 
target protein molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
5 techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with the target protein, but which do not interfere with the 
interaction between the target protein and binding partner, can be derivatized to the wells of 
the plate, and the target protein trapped in the wells by antibody conjugation. As above, 
10 preparations of an binding partner and a test compound are incubated in the target protein- 
presenting wells of the plate, and the amount of complex trapped in the well can be 
quantitated. Exemplary methods for detecting such complexes, in addition to those described 
above for the GST-immobilized complexes, include immunodetection of complexes using 
antibodies reactive with the binding partner, or that are reactive with the target protein and 
15 compete with the binding partner; as well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with the target protein or binding partner, either intrinsic or 
extrinsic activity. In the instance of the latter, the enzyme can be chemically conjugated or 
provided as a fusion protein with the target protein or binding partner. To illustrate, the 
binding partner can be chemically cross-linked or genetically fused (if it is a polypeptide) with 
20 horseradish peroxidase, and the amount of polypeptide trapped in the complex can be assessed 
with a chromogenic substrate of the enzyme, e.g. 3,3'-diamino-benzadine terahydrochloride or 
4-chloro-l-napthol. Likewise, a fusion protein comprising the polypeptide and glutathione-S- 
transferase can be provided, and complex formation quantitated by detecting the GST activity 
using l-chloro-2,4-dinitrobenzene (Habig et al (1974) J Biol Chem 249:7130). 
25 For processes which rely on immunodetection for quantitating proteins trapped in the 

complex, antibodies against the protein, such as anti-target protein antibodies, can be used. 
Alternatively, the protein to be detected in the complex can be "epitope tagged" in the form of 
a fusion protein which includes, in addition to the target protein sequence, a second 
polypeptide for which antibodies are readily available (e.g. from commercial sources). For 
30 instance, the GST fusion proteins described above can also be used for quantification of 
binding using antibodies against the GST moiety. Other useful epitope tags include myc- 
epitopes (e.g., see Ellison et al. (1991) J Biol Chem 266:21150-21157) which includes a 10- 
residue sequence from c-myc, as well as the pFLAG system (International Biotechnologies, 

Inc.) or the pEZZ-protein A system (Pharamacia, NJ). 
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An exemplary drug screening assay of the present invention includes the steps of (a) 
forming a reaction mixture including: (i) a binding partner, (ii) a target protein, and (iii) a test 
compound; and (b) detecting interaction of the binding partner and target protein. A 
statistically significant change (potentiation or inhibition) in the interaction of the target 
5 protein in the presence of the test compound, relative to the interaction in the absence of the 
test compound, indicates a potential agonist (mimetic or potentiator) or antagonist (inhibitor) 
for the test compound. The reaction mixture can be a cell-free protein preparation, e.g., a 
reconsistuted protein mixture or a cell lysate, or it can be a recombinant cell including a 
heterologous nucleic acid recombinantly expressing the target protein. 

10 Where the target protein is a receptor, or participates as part of an oligomeric receptor 

complex, e.g., which complex includes other protein subunits, the cell-free system can be, e.g., 
a cell membrane preparation, a reconstituted protein mixture, or a liposome reconstituting the 
receptor. For instance, the protein subunits of a receptor complex including the target protein 
can be purified from detergent extracts from both authentic and recombinant origins can be 

15 reconstituted in in artificial lipid vesicles (e.g. phosphatidylcholine liposomes) or in cell 
membrane-derived vesicles (see, for example, Bear et al. (1992) Cell 68:809-818; Newton et 
al. (1983) Biochemistry 22:6110-6117; and Reber et al. (1987) J Biol Chem 262:11369- 
11374). The lamellar structure and size of the resulting liposomes can be characterized using 
electron microscopy. External orientation of the receptor in the reconstituted membranes can 
20 be demonstrated, for example, by immunoelectron microscopy. The interaction of a ligand or 
test compound with liposomes containing such target protein complexes and liposomes 
without the protein can be compared in order to identify potential modulators of the receptor. 

In yet another embodiment, the drug screening assay is derived to include a whole cell 
expressing a target protein. The ability of a test agent to alter the activity of the target protein 

25 can be detected by analysis of the recombinant cell. For example, agonists and antagonists of 
the target protein biological activity can by detected by scoring for alterations in growth or 
differentiation (phenotype) of the cell General techniques for detecting each are well known, 
and will vaiy with respect to the source of the particular reagent cell utilized in any given 
assay. For the cell-based assays based on target genes identified in human cells, the 

30 recombinant cell is preferably a metazoan cell, e.g., a mammalian cell, e.g., an insect cell, e.g., 
a xenopus cell, and may be an adult cell, an embryonic cell or an oocyte. In other 
embodiments, where the target protein is a receptor, the receptor can be reconsituted in yeast 
or bacterial cells. 
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In addition to morphological studies, change(s) in the level of an intracellular second 
messenger responsive to activities dependent on the target protein can be detected. For 
example, in various embodiments the assay may assess the ability of test agent to cause 
changes in phophorylation patterns, adenylate cyclase activity (cAMP production), GTP 
5 hydrolysis, calcium mobilization, and/or phospholipid hydrolysis (IP 3 , DAG production). By 
detecting changes in intracellular signals, such as alterations in second messengers or gene 
expression, candidate agonists and antagonists to target protein-dependent signaling can be 
identified. 

Target proteins may regulate the activity of phospholipases. Inositol lipids can be 
10 extracted and analyzed using standard lipid extraction techniques. Water soluble derivatives of 
all three inositol lipids (IP h IP 2 , IP3) can also be quantitated using radiolabelling techniques or 

HPLC. 

The mobilization of intracellular calcium or the influx of calcium from outside the cell 
may be dependent on a target protein. Calcium flux in the reagent cell can be measured using 

15 standard techniques. The choice of the appropriate calcium indicator, fluorescent, 
bioluminescent, metallochromic, or Ca+^-sensitive microelectrodes depends on the cell type 
and the magnitude and time constant of the event under study (Borle (1990) Environ Health 
Perspect 84:45-56). As an exemplary method of Ca ++ detection, cells could be loaded with 
the Ca^+sensitive fluorescent dye fura-2 or indo-1, using standard methods, and any change in 

20 Ca"*" 1 " measured using a fluorometer. 

In certain embodiments of the assay, it may be desirable to screen for changes in 
cellular phosphorylation. The ability of compounds to modulate serine/threonine kinase or 
tyrosine kinase activation could be screened using colony immunoblotting (Lyons and Nelson 
(1984) PNAS 81:7426-7430) using antibodies against phosphorylated serine, threonine or 

25 tyrosine residues. Reagents for performing such assays are commercially available, for 
example, phosphoserine and phosphothreonine specific antibodies which measure increases in 
phosphorylation of those residues can be purchased from comercial sources. 

Certain of the target protein may set in motion a cascade involving the activation and 
inhibition of downstream effectors, the ultimate consequence of which is, in some instances, a 
30 detectable change in the transcription or translation of a gene. By selecting transcriptional 
regulatory sequences from such target genes, e.g., that are responsible for the up- or down- 
regulation of these genes, and operatively linking such promoters to a reporter gene, the 
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present invention provides a transcription based assay that is sensitive to the ability of a 
specific test compound to influence signalling pathways dependent on the target protein. 

In an exemplary embodiment, the -subject assay comprises detecting, in a cell-based 
assay, change(s) in the level of expression of a gene controlled by a transcriptional regulatory 
5 sequence responsive to signaling by a target protein. Reporter gene based assays of this 
invention measure the end stage of the above described cascade of events, e.g., transcriptional 
modulation. Accordingly, in practicing one embodiment of the assay, a reporter gene construct 
is inserted into the reagent cell in order to generate a detection signal dependent on signaling 
by the target protein. Expression of the reporter gene, thus, provides a valuable screening tool 
10 for the development of compounds that act as agonists or antagonists of target protein- 
dependent signalling. 

In practicing one embodiment of the assay, a reporter gene construct is inserted into the 
reagent cell in order to generate a detection signal dependent on second messengers generated 
by the target protein. Typically, the reporter gene construct will include a reporter gene in 

15 operative linkage with one or more transcriptional regulatory elements responsive to signal 
transduction from the target protein, with the level of expression of the reporter gene providing 
the detection signal. The amount of transcription from the reporter gene may be measured 
using any method known to those of skill in the art to be suitable. For example, mRNA 
expression from the reporter gene may be detected using RNAse protection or RNA-based 

20 PCR, or the protein product of the reporter gene may be identified by a characteristic stain or 
an intrinsic activity. The amount of expression from the reporter gene is then compared to the 
amount of expression in either the same cell in the absence of the test compound or it may be 
compared with the amount of transcription in a substantially identical cell that lacks the target 
receptor protein. Any statistically or otherwise significant difference in the amount of 

25 transcription indicates that the test compound has in some manner altered the inductive activity 
of the target protein. 

As described in above, in preferred embodiments the gene product of the reporter is 
detected by an intrinsic activity associated with that product. For instance, the reporter gene 
may encode a gene product that, by enzymatic activity, gives rise to a detection signal based 
30 on color, fluorescence, or luminescence. In other preferred embodiments, the reporter or 
marker gene provides a selective growth advantage, e.g., the reporter gene may enhance cell 
viability, relieve a cell nutritional requirement, and/or provide resistance to a drug. Many 
reporter genes are known to those of skill in the art and others may be identified or synthesized 
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by methods known to those of skill in the art. A reporter gene includes any gene that expresses 
a detectable gene product, that may be RNA or protein. 

In still another embodiment of a drug screening, a two hybrid assay can be generated 
with a target protein and binding partner. Drug dependent inhibition or potentiation of the 
5 interaction can be scored. The two hybrid assay formats described in the art can be readily 
adaoted for such drug screening embodiments. See, for example, U.S. Pat. Nos. 5,283,317, 
5,580,736 and 5,695,941; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol 
Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; and Iwabuchi et al. 
(1993) Oncogene 8:1693-1696) 
] 0 In addition to small molecules that may be identified, e.g., by the drug screening assays 

described above, other agents capable of modulating the activity of the target gene product 
may include peptide domains (fragments) of the target protein. A "mutant" as used herein 
refers to a peptide having an amino acid sequence which differs from that of the naturally 
occurring peptide or protein by at least one amino acid. Mutants may have the same biological 
15 and immunological activity as the naturally occurring protein. However, the biological or 
immunological activity of mutants may differ or be lacking. For example, a protein mutant 
may act as an agonist, antagonist (competitive or non-competitive), or partial agonist of the 
function of the naturally occurring protein. 

For example, homologs of the target proteins (both agonist and antagonist forms) can 
20 be generated using, for example, alanine scanning mutagenesis and the like (Ruf et al. (1994) 
Biochemistry 33:1565-1572; Wang et al. (1994) J. Biol. Chem. 269:3095-3099; Balint et al. 
(1993) Gene 137:109-118; Grodberg et al. (1993) Eur. J. Biochem. 218:597-601; Nagashima 
et al. (1993) J. Biol. Chem. 268:2888-2892; Lowman et al. (1991) Biochemistry 30:10832- 
10838; and Cunningham et al. (1989) Science 244:1081-1085), by linker scanning 
25 mutagenesis (Gustin et al. (1993) Virology 193:653-660; Brown et al. (1992) Mol. Cell Biol. 
12:2644-2652; McKnight et al. (1982) Science 232:316); by saturation mutagenesis (Meyers et 
al. (1986) Science 232:613); by PCR mutagenesis (Leung et al. (1989) Method Cell Mol Biol 
1:11-19); or by random mutagenesis (Miller et al. (1992) A Short Course in Bacterial 
Genetics, CSHL Press, Cold Spring Harbor, N.Y.; and Greener et al. (1994) Strategies in Mol 
30 Biol 7:32-34). Linker scanning matagenesis, particularly in a combinatorial setting, is on 
attractive method for identifying truncated (such as constitutively active or dominant 
negative) forms of a target protein. 

The invention also contemplates the reduction of the subject target protein to generate 
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mimetics, e.g. peptide or non-peptide agents, that are able intefere with., or mimic, the effect 
of the authentic target protein on the cells. Such peptidomimetics can act as drugs for the 
modulation of smooth cell differentiation. 

Peptidomimetics are commonly understood in the pharmaceutical industry to include 
5 non-peptide drugs having properties analogous to those of the mimicked peptide. The 
principles and practices of peptidomimetic design are known in the art and are described, for 
example, in Fauchere, Adv. Drug Res. 15:29 (1986); and Evans et al, J. Med. Chem. 30:1229 
(1987). Peptidomimetics which bear structural similarity to therapeutically useful peptides 
may be used to produce an equivalent therapeutic or prophylactic effect. Typically, such 

10 peptidomimetics have one or more peptide linkages optionally replaced by a linkage that may 
convert desirable properties such as resistance to chemical breakdown in vivo. These linkages 
may include -CH 2 NH-, -CH 2 S-, -CH 2 -CH 2 -, -CH-CH-, -COCH r , -CH(OH)CH 2 -, and - 
CH 2 SO-. Peptidomimetics may exhibit enhanced pharmacological properties (biological half 
life, absorption rates, etc.), different specificity, increased stability, production economies, 

1 5 lessened antigenicity and the like which makes their use as therapeutics particularly desirable. 

Such mutagenic techniques as described above are also particularly useful for mapping 
the determinants of a target proteins which participate in protein-protein interactions. To 
illustrate, the critical residues of a target protein that are involved in molecular recognition of 
other cellular proteins (or nucleic acid) can be determined and used to generate 

20 peptidomimetics which maintain at least a portion of that binding activity. By employing, for 
example, scanning mutagenesis to map the amino acid residues involved in binding, 
peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to the kinase. For instance, non-hydrolyzable peptide 
analogs of such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in 

25 Peptides: Chemistry and Biology. G. R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 
1988), azepine (e.g., see Huffman et al. in Peptides: Chemistry and Biology . G. R. Marshall 
ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam rings (Garvey et 
al - in Peptides: C hemistry and Biology. G. R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) J. Med. Chem . 

30 29:295; and Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th 
American Peptide Symposium) Pierce Chemical Co. Rockland, 111., 1985), p-tum dipeptide 
cores (Nagai et al. (1985) Tetrahedron Lett 26:647: and Sato et al. (1986) J Chem Soc Perkin 
Trans_l:1231), and P-aminoalcohols (Gordon et al. (1985) Biochem Biophvs Res Commun 
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126:419; and Dann et al. (1986) ftiochem Bio phvs Res Commun 134:71). 

Modulation of the activity of a target gene according to the invention includes methods 
employing specific antisense polynucleotides complimentary to all or part of the nucleotide 
sequences encoding peptide domains comprising the target protein or antisense 
5 polynucleotides complimentary to all or part of the 3* or 5' noncoding regions of the target 
gene. Such complimentary antisense polynucleotides may include nucleotide additions, 
deletions, substitutions and transpositions, providing that specific hybridization to the target 
sequence persists. 

As used herein, "antisense" therapy refers to administration or in situ generation of 
10 oligonucleotide probes or their derivatives which specifically hybridize (e.g. bind) under 
cellular conditions with cellular mRNA and/or genomic DNA encoding a target protein. The 
hybridization should inhibit expression of that protein, e.g. by inhibiting transcription and/or 
translation. The binding may be by conventional base pair complementarity, or, for example, 
in the case of binding to DNA duplexes, through specific interactions in the major groove of 
15 the double helix. In general, "antisense" therapy refers to the range of techniques generally 
employed in the art, and includes any therapy which relies on specific binding to 
oligonucleotide sequences. 

Soluble antisense RNA or DNA oligonucleotides which can hybridize specifically to 
mRNA species encoding proteins comprising the molecular regulators, and which prevent 
20 transcription of the mRNA species and/or translation of the encoded polypeptide are 
contemplated as complimentary antisense polynucleotides according to the invention. 

An antisense construct of the present invention can be delivered, for example, as an 
expression plasmid which, when transcribed in the cell, produces RNA that is complementary 
to at least a unique portion of the target cellular mRNA. Alternatively, the antisense construct 

25 is an oligonucleotide probe that is generated ex vivo and which, when introduced into the cell 
causes inhibition of expression by hybridizing with the mRNA and/or genomic sequences of a 
target gene. Such oligonucleotide probes are preferably modified oligonucleotide that are 
resistant to endogenous nucleases, e.g. exonucleases and/or endonucleases, and is therefore 
stable in vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides are 

30 phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U.S. 
Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, general approaches to 
constructing oligomers useful in antisense therapy have been reviewed, for example, by Van 
der Krol et al. (1988) Biotechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 
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2668. 

Several considerations should be taken into account when constructing antisense 
oligonucleotides for the use in the methods of the invention: (1) oligos should have a GC 
content of 50% or more; (2) avoid sequences with stretches of 3 or more G's; and (3) 
5 oligonucleotides should not be longer than 25-26 mers. When testing an antisense 
oligonucleotide, a mismatched control can be constructed. The controls can be generated by 
reversing the sequence order of the corresponding antisense oligonucleotide in order to 
conserve the same ratio of bases. 

Computer-aided molecular modeling of the target proteins can be used to study three- 
10 dimensional structures using computer visualization techniques. Novel designs of low 
molecular weight inhibitors or oligopeptides can then be analyzed for selective inhibition. 
Descriptions of targeted drug design can be found in Kuntz, "Structure-Based Strategies for 
Drug Design and Discovery," Science, 257:1078-1082 (1992) and Dixon, "Computer-Aided 
Drug Design: Getting the Best Results," Trends in Biotechnology. 10:157-^ (1992). 
15 Specific applications of the binding of inhibitors to targets using computer modeling have 
been described in Piper et al., "Studies Aided by Molecular Graphics of Effects of Structural 
Modifications on the Binding of Antifolate Inhibitors to Human Dihydrofolate Reductase," 
Proc. Am. Assoc. Canc er Res. Annual Meeting. 33:412 (1992); Hibert et al., "Receptor 3D- 
Models and Drug Design," Therapie fParis), 46:445-451 (1991)(serotonin receptor recognition 
20 sites). Computer programs that can be used to conduct three-dimensional molecular modeling 
are described in Klopman, "Multicase 1: A Hierarchical Computer Automated Structure 
Evaluation Program," Quantitative Structure- Activity Relationships. 11:176-184 (1992); 
Pastor et al., "The Edisdar Programs Rational Drug Series Design," Quantitative Structure- 
Activity Relationships 10:350-358 (1991); Bolis et al, "A Machine Learning Approach to 
25 Computer-Aided Molecular Design," J. Computer Aided Molecular Design. 5:617-628 (1991); 
and Lawrence and Davis, "CLIX: A Search Algorithm for Finding Novel Ligands Capable of 
Binding Proteins of Known Three-Dimensional Structure," Proteins Structure Functional 
Genetics. 12:31-41 (1992). 

In still other embodiments, low molecular weight inhibitors specific for the molecular 
30 regulators can be predicted by molecular modeling and synthesized by standard organic 
chemistry techniques. Computer modeling can identify oligopeptides which enhance the 
smooth muscle cell differentiation or block their dedifferentiation. Techniques for producing 
the identified oligopeptides are well known and can proceed by organic synthesis of amino 
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acids, by genetic engineering techniques, or by PCR based amplification. Silverman, The 
Orpanic Chemistry of Druf Tlesitm and D rug Action. Academic Press (1992). The inhibitors 
of this invention can be identified as those inhibitors that selectively inhibit the smooth muscle 
cell dedifferentiation. 

5 

v. Pharmaceutical Preparations of Id entified Asents 

After identifying certain test SGEs, or compounds identified from the drug screening 
assays descried above, the practitioner of the subject assay will continue to test the efficacy 
and specificity of the selected agents both in vitro and in vivo. Whether for subsequent in vivo 
10 testing, or for administration to an animal as an approved drug, SGEs and other compounds 
identified in the subject assays can be formulated in pharmaceutical preparations for in vivo 
administration to an animal, preferably a human. SGEs that are active as polypeptides can be 
turned into peptidomimetics. Likewise, antisense SGEs can be generated as non-hydrolizable 
analogs (e.g., resistant to nuclease degradation) and formulated for direct administration, or, as 
15 appropriate, provided in the form of an expression vector, such as for gene therapy, which 
produces the antisense molecule as a transcript. SGEs that are active as polypeptides can also 
be provided in the form of an expression vector for use, e.g., in gene therapy. 

The peptides, proteins and antisense selected in the subject assay, or gene therapy 
vectors encoding such molecules, may accordingly be formulated for administration with a 
20 biologically acceptable medium, such as water, buffered saline, polyol (for example, glycerol, 
propylene glycol, liquid polyethylene glycol and the like) or suitable mixtures thereof. The 
optimum concentration of the active ingredient(s) in the chosen medium can be determined 
empirically, according to procedures well known to medicinal chemists. As used herein, 
"biologically acceptable medium" includes any and all solvents, dispersion media, and the like 
25 that may be appropriate for the desired route of administration of the pharmaceutical 
preparation. The use of such media for pharmaceutical^ active substances is known in the art. 
Except insofar as any conventional media or agent is incompatible with the activity of the 
compound, its use in the pharmaceutical preparation of the invention is contemplated. Suitable 
vehicles and their formulation inclusive of other proteins are described, for example, in the 
30 book Remington's Pharmaceutical Sciences (Remington's Pharmaceutical Sciences. Mack 
Publishing Company, Easton, Pa., USA 1985). These vehicles include injectable "deposit 
formulations". Based on the above, such pharmaceutical formulations include, although not 
exclusively, solutions or freeze-dried powders of the compound in association with one or 
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more pharmaceutical^ acceptable vehicles or diluents, and contained in buffered media at a 
suitable pH and isosmotic with physiological fluids. In preferred embodiment, the SGE 
compound can be disposed in a sterile preparation for topical and/or systemic administration. 
In the case of freeze-dried preparations, supporting excipients such as, but not exclusively, 
5 mannitol or glycine may be used and appropriate buffered solutions of the desired volume will 
be provided so as to obtain adequate isotonic buffered solutions of the desired pH. Similar 
solutions may also be used for the pharmaceutical compositions of compounds in isotonic 
solutions of the desired volume and include, but not exclusively, the use of buffered saline 
solutions with phosphate or citrate at suitable concentrations so as to obtain at all times 
1 0 isotonic pharmaceutical preparations of the desired pH, (for example, neutral pH). 
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CLAIMS: 

1. A method for identifying agents which selectively confer a desired phenotype on a 
target cell, comprising: 

(i) transfecting subtractive cells with a library of expression vectors comprising a 
5 variegated population of coding sequences for potential synthetic genetic 

elements (SGEs); 

(ii) isolating, from the subtractive cells, those SGE vectors of the SGE library which 
do interfere with the detection of the desired phenotype in the target cell; 

(iii) transfecting target cells with the sub-population of SGE vectors isolated in step 
10 (ii); and 

(iv) isolating those SGE vectors which confer the desired phenotype to the target cell. 

2. A method for identifying agents with selectively antiproliferative activity for a target 
cell, comprising: 

15 (j) transfecting subtractive cells with a library of expression vectors comprising a 

variegated population of coding sequences for potential synthetic genetic 
elements (SGEs); 

(ii) isolating those SGE vectors of the SGE library that are not antiproliferative to the 
subtractive cells; 

20 (iii) transfecting target cells with the non-antiproliferative SGE vectors isolated in 

step (ii); and 

(iv) isolating those SGE vectors that are antiproliferative to the target cell. 

3. The method of claim 2, wherein the target and subtractive cells are eukaryotic cells. 
25 4. The method of claim 3, wherein the target and subtractive cells are mammalian cells. 

5. The method of claim 3, wherein at least one of the target and subtractive cells are 
human cells. 

6. The method of claim 2, wherein the target cell is a transformed cell, and subtractive 
cell is an untransformed cell. 
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7. The method of claim 2, wherein the target cell is infected by a virus, and subtractive 
cell is a cell not infected with the virus. 

8. The method of claim 2, wherein the expression vectors are viral vectors. 

9. The method of claim 8, wherein the viral vectors are retroviral vectors. 



10. 



The method of claim 2, wherein the SGE library is generated from a normalized cDNA 
library. 



11. The method of claim 2, wherein the SGE library is generated from a substrative cDNA 
library. 

12. The method of claim 2, wherein the SGE is a sense oriented sequence encoding a 
peptide. 

13. The method of claim 2, wherein the SGE is an antisense-oriented sequence encoding an 
antisense RNA. 

14. An isolated SGE identified according to the method of claim L 

15. A synthetic oligonucleotide having a nucleotide sequence from about 12 nucleotides to 
all of the nucleotide sequence of an antisense RNA encoded by an SGE identified 
according to the method of claim 13. 

16. An isolated peptide encoded by an SGE identified according to the method of claim 12. 

17. An peptidomimetic corresponding to a peptide encoded by an SGE identified according 
to the method of claim 12. 

1 8. The method of claim 1, comprising the further step of formulating, in a pharmaceutical 
preparation, one or more SGEs that are identified as lethal to the target cell. 

19. A method for identifying a compound having a selective biological effect on a target 
cell comprising, 

(i) providing a drug screening assay for detecting agents which inhibit or 
potentiate the function of a target gene or target gene product corresponding to 
a SGE identified by the method according to claim 1 ; and 

(ii) testing, in the drug screening assay, the ability of a test compound to modulate 
the function of the target gene or target gene product. 
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20. The method of claim 19, comprising the further step of formulating a pharmaceutical 
preparation comprising one or more agents identified as able to to modulate the 
function of the target gene or target gene product 

21. A method for modulating a phenotype of a target cell comprising contacting the target 
cells with an agent identified by the method of claiml9 able to modulate the function 
of the target gene or target gene product. 

22. The method of claim 2, wherein the antiproliferative activity is the result of cell death. 



15 
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Figure 1 
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Figure 2 

Construct a tumor specific SGE library 
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Figure 3 Isolation of disease specific SGEs 
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Figure 4 Isolation of SGEs which confer sensitivity to 

pesticides in plants 
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Figure 5 Isolation of SGEs which confer resistance to 

chemotherapeutic agents 
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Fiqure 6 Isolation of SGEs which inhibit NGF induced 

neuronal differentiation of PC1 2 cells 
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Figure 7 Isolation of SGEs which inhibit the neuronal 

differentiation of ES cells 
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Figure 8 Isolation of SGEs which selectively kill prion 
y infected cells 
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